Integrating Verbal Idioms into an NLP System

نویسندگان

  • Jorge Baptista
  • Nuno J. Mamede
  • Ilia Markov
چکیده

This paper describes the integration of verbal idioms into an Natural Language Processing (NLP) system, adopting a construction approach, which is based on the prior parsing stage, so that these MultiWord Expressions (MWE) can be taken into account in subsequent tasks, such as semantic role labeling or whole-part relation extraction. The paper focuses on body-part nouns, which are often part of many verbal idioms, and uses a manually annotated corpus to evaluate its parsing strategy. Results showed a precision of 0.92, 0.83 recall, 0.87 f-measure and an accuracy 0.99.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COST Action IC1207 PARSEME meeting

Dealing with idioms in Natural Language Processing systems is difficult, among other reasons, because their architecture must be conceived in such a way that it should not preclude the processing of both free word combinations and these, more constraint, expressions. On the other hand, many idioms do have syntactic structure, and can undergo several types of formal variation, thus making them h...

متن کامل

On the Effects of Linguistic, Verbal, and Visual Mnemonics on Idioms Learning

Finding more effective ways of teaching second language idioms has been a long standing concern of many teaching practitioners and researchers. This study was an endeavorto explore the effects of three linguistic mnemonic devices (etymological elaboration, keyword method, and translation) on EFL learners’ recognition and recall of English idioms. To achieve the purpose of the study, ninety male...

متن کامل

Reusable Lexical Representations for Idioms

In this paper I introduce (1) a technically simple and highly theory-independent way for lexically representing flexible idiomatic expressions, and (2) a procedure to incorporate these lexical representations in a wide variety of NLP systems. The method is based on Structural EQuivalence Classes for Idioms and therefore called the SEQCI method. I illustrate the approach using the Rosetta MT sys...

متن کامل

Implementing European Portuguese Verbal Idioms in a Natural Language Processing System

This paper is based on an extant lexicon-grammar of European Portuguese verbal idioms (e.g., deitar mãos à obra, literally, ̳to throw hands to the work‘, ̳to start working‘.). This a database containing about 2,400 expressions, along with all relevant information on the sentence structure, distributional constraints and transformational properties of these frozen sentences. In this paper, we pr...

متن کامل

Elaborating the parameterized Equivalence Class Method for Dutch

This paper discusses the parameterized Equivalence Class Method for Dutch, an approach developed to incorporate standard lexical representations for Dutch idioms into representations required by any specific NLP system with as minimal manual work as possible. The purpose of the paper is to give an overview of parameters applicable to Dutch, which are determined by examining a large set of data ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014